Fault Tolerance

aws/concept

💡 Definition

Fault Tolerance refers to a system's ability to continue operating without interruption when one or more of its components fail. It is designed to prevent service interruptions due to individual component failures.

🔑 Key Concepts

⚙️ How it Works

In AWS, fault tolerance is built into many services (e.g., S3 is highly durable by design). For applications, it's achieved by distributing resources across Multiple AZs, using services that automatically recover (like Auto Scaling replacing unhealthy EC2 instances), and designing with redundancy.

🎯 Use Cases

💰 Pricing Model

📝 Exam Tips (CLF-C02)


See Also: * High Availability * AZ * Shared Responsibility Model